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Abstract 

Cell-free RNA and protein synthesis (CFPS) Is becoming Increasingly used for protein production as yields increase and costs 
decrease. Advances In reconstituted CFPS systenns such as the Protein synthesis Using Recombinant Elements (PURE) system 
offer new opportunities to tailor the reactions for specialized applications including in vitro protein evolution, protein 
microarrays, Isotopic labeling, and incorporating unnatural amino acids. In this study, using firefly luciferase synthesis as a 
reporter system, we improved PURE system productivity up to 5 fold by adding or adjusting a variety of factors that affect 
transcription and translation. Including Elongation factors (EF-Ts, EF-Tu, EF-G, and EF4), ribosome recycling factor (RRF), 
release factors (RFl, RF2, RF3), chaperones (GroEL/ES), BSA and tRNAs. The work provides a more efficient defined in vitro 
transcription and translation system and a deeper understanding of the factors that limit the whole system efficiency. 
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Introduction 

As an alternative to in vivo protein syntliesis, CFPS systems 
provide the ability to produce a variety of compounds and present 
numerous advantages: concentrations of some system components 
can be controlled, dierefore a large parameter space can be 
studied [1,2]; toxic products to a host cell, such as membrane 
proteins, can be produced with better yields [3]. Due to the high 
engineering flexibility of CFPS, they are applied to high- 
throughput methodologies [4-6], non-standard amino acids 
incorporation into proteins, and in vitro protein evolution [7] . 

Cell-free translation has been achieved by two approaches. One 
approach is based on crude cell extract, derived from Escherichia 
coli, wheat germ or rabbit reticulocytes [8-12]. Cell-free systems 
based on crude extracts have been optimized for long-lived 
synthesis and high yield [13,14], but their range of applications is 
still limited by their complicated nature. Issues include indepen- 
dent of peptide bond formation [15], and template nucleic acids or 
protein products degradation by nucleases or proteases [16]. The 
other approach is reconstituting well-defined protein synthesis 
systems from recombinant factors. The "Protein synthesis Using 
Recombinant Elements" (PURE) system is a partially recombi- 
nant, CFPS system reconstituted solely from elements essential to 
E. coli translation [17]. The PURE system does not contain some 
of the detrimental enzymes found in extracts. It provides higher 
reaction controllability in comparison to crude extract-based 
CFPS systems for translation studies and biotechnology applica- 
tions [18]. Table 1 systematically compares the PURE system with 
a commercialized E. coli crude extract CFPS (the 5 PRIME RTS 
system). The ribosome concentration is ~2.4 |a.M in the PURE 
system and ~ 1 .6 |J,M in the crude extract system. The PURE 
system's productivity as measured by the yield of active firefly 
luciferase is 3 fold lower [19] while its cost is ~4 times higher per 



gram of protein produced than those of the crude extract system. 
Additionally, the PURE system's preparation procedure, which 
involves multiple column based purifications, is more labor and 
time consuming than the preparation of crude extract system. To 
simplify the preparation procedure, our group applied the 
Multiplex Automated Genome Engineering (MAGE) method to 
insert hexa-histidine sequences into 38 essential genes in vivo that 
encode the entire translation machinery of £. coli, and produced a 
streamlined, co-purified, and reconstituted PURE system in vitro 
that is about 1 1 % as active as a commercial system (New England 
Biolabs) [20]. Therefore developing a cost-effective PURE system 
with higher productivity is necessary. 

Ignoring energy supply and small molecule metabolism, the 
most common focus of CFPS enhancement efforts, several factors 
have been identified that improve E. coli crude extract CFPS yield 
and stability. It has been shown [21] that adding purified 
elongation factors (EEs) to an E. coli crude extract CFPS (a 
modified PANOx-SP system [22]) increases protein synthesis rates 
and yields by increasing both translation initiation and elongation 
rates. Multiple groups have identified improvements by studying 
translation dynamics in a simplified PURE system that lacks 
transcription and tRNA synthesis [23-25]. For instance, Pavlov, 
M. Y etc. reported that ribosome recycling times are minimized 
when RFl concentrations are slightiy smaller than the total 
ribosome concentration in an in vitro translation system [23]. 
Chaperone systems (e.g., GroEL/ES and DnaK/DnaJ/GrpE) 
were shown to alleviate protein aggregation when added to the 
PURE system [26-28], but their effects on the yield of different 
functional proteins are yet to be characterized. Recentiy identified 
Elongation Factor 4 (EF4), which induces back-translocations in 
ribosomes that have experienced defective translocations, was also 
shown to affect E. coli crude extract CFPS productivity [29]. 
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Table 1. Comparison of PURE system with RTS system {£ coli crude cell extract based cell-free system). 







5 PRIME RTS System (f. co// crude cell extract) 


PURE System 


Productivity 


2—20 reaction 


u.D— 1 u |ig/DU jiL reaciion 


System composition 


Undefined, proteases and nucleases and tmRNA exist 


Defined 


Ribosome concentration 


-1.6 


-2.4 fiM 


Engineering flexibility 


low 


high 


Linear template tolerance 


low 


high 


Preparation 


Centrifuge based, quick and simple 


Multiple column based purifications, labor intensive and time 
consuming 


Price 


$1.2-124ig protein 


$4.4-88/|ig protein 


Translation efficiency 


higher 


lower 



System productivity and ribosome concentration information were all obtained from 5 PRIME RTS system and PURE system handbook. 
doi:l 0.1 371 /journal.pone.Ol 06232.t001 



Compared to cytosol, which has a protein concentration of 200- 
300 g/L and a RNA concentration of 75-150 g/L, cell-free 
systems are very dilute. Crowding agents PEG-8000, Ficoll-70 and 
Ficoll-400, when added to cell-free systems at certain concentra- 
tions, have been found to boost transcription but inhibit 
translation [30]. 

Notably, the non-energy system factors described above have 
been explored mainly one at a time in a variety of CFPS systems, 
and little effort has been spent trying to consolidate these 
improvements within a single system. Here, using firefly luciferase 
(Flue) as a reporter, we optimized the commercialized PURE 
system productivity by adjusting several factors individually and in 
combination. We report that we were able to boost the yield of 
functional Flue between 25% and 70% through increases of EF- 
Ts, EF-Tu and EF-G concentrations; adjustments to ribosome 
recycling factor (RRF) and release factor [31] 1, 2, 3 concentra- 
tions; addition of EF4, GroEL/ES, and BSA; and increasing 
tRNA concentration; these factors taken individually or in small 
subsets. By combining all of our individual optimizations, we 
achieved a >5 fold increase in PURE system productivity as 
measured by the yield of functional Flue. We further investigated 
these factors on the synthesis of a fluorescence protein mCherry 
and an E. coli protein P-galactosidase (P-gal) in the PURE system 
and achieved ~3.2 and ~2.5 fold active product yield, respec- 
tively. 

Results 

Improving PURE system translation by adding translation 
factors 

The PURE system contains IFl, IF2, IF3, EF-Tu, EF-Ts, EF-G, 
RFl, RF3, RRF, 20 aminoacyl-tRNA synthetases (ARSs), 
methionyl-tRNA transformylase (MTF), T7 RNA polymerase, 
ribosomes, 46 tRNAs, NTPs, creatine phosphate, 10-formyl- 
5,6,7,8-tetrahydrofolic acid, 20 amino acids, creatine kinase, 
myokinase, nucleoside-diphosphate kinase, and pyrophosphatase 
[17,32]. Transcription and translation in PURE system are two 
consecutive processes. We investigated the overall outcome of the 
two processes as measured by functional Flue produced under a 
variety of conditions using a pFVEX 2.3d-Fluc plasmid harboring 
Flue gene under T7 promoter and terminator control. Reactions 
were conducted for 2 hours at 37°C. The amount of functional 
Flue produced was measured in relative luminescence units by a 
microplate reader. 



We first increased the concentration of EEs which are already 
present in the system and found translation was enhanced by the 
addition of EF-Tu, Ts and G (Figure la). Over a series of stepwise, 
proportional additions of EF-Tu, Ts and G, we observed a 
maximum increase of ~60% functional Flue when concentrations 
of EF-Tu, Ts and G were simultaneously increased by 34.85 |J.M, 
6.25 nM and 3.89 \lM respectively. 

The newly identified EF4 gene lepA was cloned from E. coli 
strain MG1655 to a pET-24b vector with a C-terminal hexa- 
histidine (his) tag and overexpressed in BL21 (DE3) E. coli cells. 
EF4 was then purified with AKTAprime (GE Healthcare) 
equipped with 5 mL HisTrap HP column and analyzed on a 4— 
12% Bis-Tris PAGE gel (Figure SI). The electrophoretic pattern 
indicated that EF4 was purified to homogeneity. As increased 
amounts of EF4 were added to PURE system, the total functional 
Flue amount increased and peaked at a ratio of 0.075 molecules 
EF4 added per 70 S. At the peak, EF4 improved PURE 
translation by ~30% in terms of the yield of functional Flue 
(Figure lb). Further addition of EF4 led to a rapid reduction in the 
functional Flue production, in agreement with the effects of EF4 in 
E. coli crude extract based CFPS [29]. 

We also increased RRF, RFl, 2, 3 concentrations by 2 |xM, 
5 (iM and 8 (iM. Compared to the original commercialized 
PURE system, we observed a maximum boost of ~55% in 
functional Flue yield when the concentrations were increased by 
2 |XM (Figure Ic). Further increase of RFs and RRF concentra- 
tions significantly decreased yield (Figure Ic). 

To test whether adding translation factors has similar effects on 
the synthesis of other proteins, we constructed another two 
reporter proteins mCherry and fi-gal into a pET-24b vector under 
T7 control and tested their expression in PURE system at different 
concentrations of translation factors. A maximum increase of 
~60% in active mCherry yield was observed when concentrations 
of EF-Tu, Ts and G were simultaneously increased by 17.44 |a,M, 
3.13 nM and 1.95 \lM respectively (Figure S2a). For P-gal, a 
maximum increase of ~28% in active product yield was reached 
when EF-Tu, Ts and G concentrations were simultaneously 
increased by 8.72 |xM, 1.56 ^M, and 0.98 |J,M respectively (Figure 
S3a). At the optimized EF-Tu, Ts and G concentrations for Flue 
synthesis, mCherry showed a boost of ~30% in active product 
yield while fi-gal showed a similar level of active product yield 
compared to the original PURE system (Figure S2a and S3a). 
When EF4 was added to the system the total active product yield 
for mCherry and fi-gal increased and also peaked at a ratio of 
0.075 molecules EF4 added per 70S. At the peak, it exliibited a 
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Figure 1. Optimization of PURE system as measured by active Flue produced by supplementing different concentrations of EF-Tu, 
Ts, G; EF4; RF1, 2, 3 and RRF. (a). Active Flue produced at different EF-Ts, Tu and G concentrations. The table below shows the actual 
concentration increase of EF-Ts, Tu and G in the PURE system, (b). Active Flue produced at different EF4 concentrations, (c). Active Flue produced at 
different RF1, 2, 3 and RRF concentrations. The table below shows the actual concentration increase of RF1, 2, 3 and RRF in the PURE system. Flue 
activities were measured in relative luminescence unit by luciferase assay and PURE system reaction without supplement was set as control. Error bars 
are ± standard deviations, with n = 3. 
doi:1 0.1 371 /journal.pone.01 06232.g001 



~2 fold active product yield for mCherry and a ~1.4 fold active 
product yield for P-gal compared to the original PURE system 
(Figure S2b and S3b). We also tried increasing the concentrations 
of RFl, RF2, RF3 and RRF when expressing mCherry and P-gal 
in the PURE system. The yield of active mCherry was maximally 
increased by 60% when RFl, RF2, RF3 and RRF concentrations 
were increased by 2 |iM (Figure S2c). The yield of active P-gal was 
boosted maximally by -40% when RFl, RF2, RF3 and RRF 
concentrations were increased by I |xM (Figure S3c). 



PURE system transcription and translation under 
macromolecular crowding conditions 

The PURE system comprises a relatively dilute solution that 
does not benefit from the macromolecular crowding effects present 
in living cells. We therefore added BSA to increase the overall 
PURE system protein concentration. Adding BSA steadily 
increased functional Flue production and reached a maximum 
boost of -70% at 15.5 |xM (Figure 2a). Further addition of BSA 
led to a steady reduction in the functional Flucyield. Then we 
decoupled PURE system transcription and translation by 
performing the PURE reaction without ribosomes and tRNAs to 
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study BSA's crowding effect on transcription only. Figure 2b Improving PURE System translation by adding chaperone 

shows the Flue mRNA yields measured with the Quant-iT systems 

RiboGreen RNA reagent. The time course (kinetics) of transcrip- Chaperone systems such as DnaK/DnaJ/GrpE and GroEL/ 

tion shows that in the presence of 15.5 nM BSA, transcription q^^^^ understood to contribute to the folding of newly 
proceeded faster than in the original PURE system with a 20% synthesized polypeptides, either interacting with these at the 
increase in the initial transcription velocity (Figure 2b). Adding ribosome or shortly after their release [26,28,34]. Niwa etc. 
another macromolecular crowding agent PEG-6000 decreased comprehensively evaluated the effects of GroEL/GroES and 
functional Flue yield (Figure 2c) possibly because macromolecular DnaK/DnaJ/GrpE on -800 aggregation-prone cytosohc E. coli 
crowding facilitated protein aggregation rather than correct proteins using PURE system and found DnaK and GroEL systems 
folding of nascent proteins [30,33] . drastically increased the solubility of hundreds of proteins [35] . A 

We also tested BSA's macromolecular crowding effects on reasonable inference is that adding such chaperones to the basic 
mCherry and P-gal synthesis in the PURE system and observed a puRE system should not only enhance protein folding and 
maximum active product yield of ~2 fold for mCherry at 15.5 hM solubility but increase functional product yield, and that this 
BSA and -1.2 fold for P-gal at 5.1 ^iM BSA (Figure S4). should obtain for exogenous as weU as native E. coli proteins. A 

previous study showed that DnaK system enhanced the binding 
activity of a single-chain antibody synthesized in the PURE system 
while the GroEL/ES system showed little effect on that [36]. 

a. b. 




• Control 

■ ♦IS.SiiMBSA 



+ PEG 6000. % (w/v) 



Figure 2. Optimization of PURE system as measured by functional Flue produced by adding macromolecular crowding agents, (a). 

Active Flue produced in the system at different BSA concentrations, (b). The time course (kinetics) of transcription measured by Flue mRNA yields 
with the Quant-iT RiboGreen RNA reagent with and without the presence of 15.5 BSA. (c). Active Flue produced in the system at different PEG- 
6000 concentrations. In (a) and (e) Flue activities were measured in relative luminescence units by luciferase assay and PURE system reaction without 
supplement was set as control. Error bars are ± standard deviations, with n = 3. 
doi:1 0.1 371/journal.pone.01 06232.g002 
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Here we investigated the effect of adding these systems to PURE 
system on fijnctional Flue yield. Functional Flue yield increased 
steadily as more GroEL/ES was added and reached a maximum 
increase of -60% with 4 GroEL/ES (Figure 3a). GroEL/ES 
also boosted active P-gal yield by ~20% at a concentration of 
4 |i.M (Figure S7a). However, it showed little positive effect on 
active product yield when applied to mCherry synthesis (Figure 
S6a). We also assessed total protein amount of Flue produced at 
each concentration of GroEL/ES via SDS-PAGE (Figure S5). 
Since Flue does not overlap in a Coomassie blue stained gel with 
any other protein present in the PURE system, the Flue band can 
be scanned and an accurate determination of the total amount can 
be made. Interestingly, adding GroEL/ES also boosts the total 
amount of protein produced. When DnaK/DnaJ/ GrpE system 
was added, it showed litde positive effect on the yield of functional 
Flue, mCherry and P-gal (Figure 3b, Figure S6b and S7b). 

Improving PURE system transcription and translation by 
adjusting tRNA, Mg^+ and ATP/GTP concentrations 

We then studied the effects of adjusting tRNA, Mg"^ and ATP/ 
GTP concentrations on PURE system protein synthesis with Flue 
as the reporter. Here we incorporated adjustments to Mg ^ and 
ATP/GTP into versions of the PURE system that already 
included previously determined optimizations to EF-Ts, Tu, G, 
EF4, RF 1, 2, 3, RRF, GroEL/GroES and BSA for Flue. 

When we followed this same procedure for testing adjusted 
tRNA concentrations, however, we included an additional 
combination of EF-Ts and Tu concentrations as well, where these 
were lower than the previously optimized values, holding the 
optimized concentrations of all other factors constant. Lowering 
EF-Ts and Tu concentrations in this way decreased the yield of the 
optimized system. Increasing tRNA concentration by 56 
A260 units/mL in the system gave a 25% increase in functional 
Flue yield for both concentration sets of EF-Ts, Tu (Figure 4a). 
Surprisingly, after combining the optimized concentrations of 
protein factors, increasing tRNA concentration by 56 A260 units/ 



mL decreased the active product yield of mCherry and P-gal 
(Figure S8). 

We first tested the effect of Mg"'*^ concentration on the original, 
vmoptimized, PURE system. Adding more Mg^^ drastically 
decreased PURE system productivity as measured by active Flue 
produced (Figure 4b). However, when we increased Mg^"*" 
concentration by 2 mM and 4 mM and also increased ATP or 
GTP concentration by 2 mM at each concentration of Mg^^ in 
our optimized PURE system, we did not observe any significant 
changes in the yield of functional Flue (Figure 4c). This suggests 
that our optimized PURE system exhibits reduced sensitivity to 
Mg^"*^ compared to the original PURE .system. 

Assessment of improved PURE system by combining the 
optimized conditions 

In the previous sections, we have optimized PURE system by 
individually assaying the protein factors and tRNA concentrations. 
Here we combined the individually optimized concentrations of 
EEs, RFs, RRF, GroEL/ES, BSA and tRNA and eventuaUy 
achieved >5 fold increase in PURE system productivity as 
measured by the yield of functional Flue compared to the original 
PURE system (Figure 5). By combining the individually optimized 
concentrations of EEs, RRF, RFs, and BSA for mCherry, we 
achieved a ~3.2 fold active product yield (Figure S8a). When the 
individually optimized concentrations of EEs, RRF, RFs, GroEL/ 
ES and BSA for P-gal were combined, a ~2.5 fold active product 
yield was achieved (Figure S8b). 

Discussion 

In this study, we added or adjusted the concentration of a 
variety of factors that affect transcription and translation and 
achieved a boost in PURE system productivity of more than 5 fold 
as measured by functional Flue produced. The work provides a 
deeper understanding of cell-free system efficiency limitation 
factors. For both E. coli crude extract based cell-free system [21] 
and reconstituted PURE system, adding more elongation factors 
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Figure 3. Optimization of PURE system as measured by functional Flue produced by adding chaperone systems GroEL/ES and 
DnaK/DnaJ/GrpE. (a). Active Flue produced at different GroEL/GroES concentrations, (b). Active Flue produced at different DnaK/DnaJ/GrpE 
concentrations. Flue activities were measured In relative luminescence unit by luclferase assay and PURE system reaction without supplement was set 
as control. Error bars are ± standard deviations, with n = 3. 
dol:1 0.1 371/journal.pone.01 06232.g003 
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Figure 4. Optimization of PURE system as measured by functional Flue produced by adjusting tRNA, ATP and GTP concentrations, 
(a). Increasing tRNA concentration by 56 A260 units/ml boosts functional Flue yield by 25%. The table below shows the actual concentration increase 
of EF-Ts, Tu, G and tRNA in each reaction. Reaction 1 is taken as control. In reaction 2, 3, 4 and 5, EF4; RF 1, 2, 3, RRF; GroEL/GroES and BSA were added 
at their optimized concentrations, (b). Increasing Mg^* concentration decreases functional Flue yield in the original PURE system, (c). Increasing 
Mg^*, ATP and GTP concentrations has little effect on final yield of functional Flue in our optimized PURE system with optimized concentrations of EF- 
Ts, Tu, G; EF4; RF 1,2, 3, RRF; GroEL/GroES and BSA. Flue activities were measured in relative luminescence unit by luciferase assay and PURE system 
reaction without supplement was set as control. Error bars are ± standard deviations, with n = 3. 
doi:10.1371/journal.pone.0106232.g004 



can boost the system productivity (Figure la) which indicates that 
system efficiency is limited by translation elongation capacity. In 
fact under rapid growth conditions, EF-Tu is the most abundant 
protein not only in E. coli but in most bacterial cells and reaches 
~10 times of the ribosome concentration to accelerate protein 
synthesis required by fast growth [37,38]. The PURE system 
contains ~2.4 |xM ribosome. Adding EF-Tu to the system makes 
the EF-Tu to ribosome ratio more close to the ratio in vivo during 
rapid growth phase so that higher protein synthesis efficiency can 
be achieved. Moreover the concentrations of RFl, 2, 3 and RRF 
in the original PURE system are —0.25 ^iM, —0.24 ^M, 



—0.17 |J,M and —0.485 |a.M, respectively. When we increased 
RFl, 2, 3 and RRF concentrations by 2 |a.M, the final 
concentrations of RFl, 2, 3 and RRF concentrations were close 
to the ribosome concentration 2.4 |j,M in PURE system. These 
steps presumably accelerated ribosome recycling by minimizing 
ribosome recycling time [23,24] and resulted in a 55% increase in 
active Flue yield (Figure Ic), suggesting that ribosome release and 
recycling are also limiting factors for PURE translation. It has also 
been reported that after the termination step of translation, the 
post-termination complex, composed of the ribosome, mRNA, 
and a deacylated tRNA, is processed by the concerted action of 
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Figure 5. Optimization of PURE system with tKie best combination of EF-Ts, Tu, G; EF 4; RF 1,2,3, RRF; GroEL/GroES; BSA and tRNA 
concentrations as measured by functional Flue produced. Flue activities were measured in relative luminescence unit by luciferase assay and 
PURE system reaction without supplement was set as control. Error bars are ± standard deviations, with n = 3. 
doi:1 0.1 371 /journal.pone.01 06232.g005 



RRF, EF-G, GTP and probably IF3 to prepare the ribosome for a 
fresh round of protein synthesis [39]. We tested tliis effect on Flue 
synthesis by increasing the concentrations of these three factors by 
2 |XM and eventually achieved a ~30% boost in active Flue yield 
(Figure S9a). Surprisingly, when we only increased IFl, 2, 3 
concentrations by 2 |J,M and 4 |xM, Flue translation was severely 
inhibited (Figure S9b). 

Another interesting finding is that supplementing chaperone 
system GroEL/ES not only improved the yield of functional Flue 
(Figure 3a) but also increased the total amount of Flue expressed 
(Figure S5). GroEL/ES also increased the amount of active P-gal 
produced but had little positive effect on active mCheny synthesis 
(Figure S6a and S7a). In contrast, the DnaK/DnaJ/GrpE system 
did not show much positive effect on the synthesis of all of the 
three proteins (Figure 3b, Figure S6b and S7b). This finding is 
different from the previous result which saying the DnaK system 
enhanced the expression of functional single-chain antibody but 
the GroEL/ES system didn't [36]. The contradiction between our 
findings indicates both GroEL/ES and DnaK/DnaJ/GrpE 
contribute to protein folding and increase the level of functional 
protein produced but with their own specificity on different 
proteins. 



In fact, the fidelity of protein synthesis is sensitive to changes in 
magnesium concentration and at a high Mg^^ concentration, 
ribosomes might become stuck at a translocation step [29] . In our 
case, an increase of only 4 mM Mg^"*" in the commercialized 
PURE system reduced the yield of functional Flue by 70% 
(Figwe 4b). Using our optimized system dramatically altered the 
picture: when Mg^"*" concentration was increased by 4 mM, the 
yield of functional Flue was not reduced (Figure 4c). 

Our finding that the crowding agent BSA can significandy 
enhance PURE system productivity is not surprising since 
transcriptions and translations in vivo occur in macromolecular 
crowding environments. Adding BSA enhanced the association of 
biomolecules due to its excluded volume effect, which increased 
the effective concentrations of the enzymes and biomolecular 
reactants [40,41], and so altered the rates and equilibrium 
constants of their reactions [42]. In addition, BSA as a crowding 
agent, also increased the solution viscosity, thus could dramatically 
reduce the diffusion coefficients of biomolecules by factors up to 10 
fold [43] . Our observation was in agreement with other studies on 
macromolecular crowding effects on biochemical reactions 
involving DNA and protein association. For example, DNA 
replication [44-46], ligation [47], PGR [48], restriction digestion 
[49], nuclease degradation [50] and transcription [30] all can be 
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facilitated by crowding agents because crowding agents can 
dramatically increase the association between enzymes and DNA. 
Here we were able to move a step further by showing that coupled 
transcription and translation can be significantly boosted by BSA 
as a whole, whereas a previous study instead used a two-stage 
system in which macromolecular crowding was used to enhance a 
transcription reaction, after which the mRNA was purified and 
transferred to a second translation reaction [30]. We also found 
that adding PEG-6000 rather than BSA reduced rather than 
increased productivity, presumably by causing proteins to 
aggregate. 

Mimicking the intracellular environment of living cells has 
produced significant improvements in PURE system expression; 
however, there are some new factors that remain to be addressed. 
A larger parameter space of the concentrations of other factors 
present in the PURE system, such as tRNA synthetases, can also 
be tested. The recentiy identified Elongation factor P can prevent 
ribosome from stalling during synthesis of proteins containing 
consecutive prolines, such as PPG, PPP, or longer proline strings, 
in natural and engineered model proteins [31,51]. We observed 
that the PURE system produces half translated products (Figure 
SIO) most likely due to ribosome stalling on mRNA. Therefore, 
besides EF-P, there might be other unidentified factors also 
helping rescue stalled ribosomes. And all of these factors can serve 
as supplements for PURE system translation. Finally, it is also 
worth pointing out that the elficiency of CFPS is also limited by 
energy supply and inhibitory by-products generated from trans- 
lation process [52,53]. This problem can possibly be resolved by a 
continuous-exchange PURE system. 

In summary, we believe our improved PURE system is an 
attractive platform for in vitro protein synthesis. It produces more 
than 5 fold active Flue compared with the original system, and our 
work makes the PURE system more cost-efficient with a 
comparable productivity and cost level to the E. coli crude extract 
CFPS. Our results will have profound implications for systems and 
synthetic biology by enabling better reproducibility of gene 
transcription and translation process in an in vitro setting. Finally, 
by reassembling the "central-dogma" pathway of molecular 
biology with purified components, our system can serve as a basis 
for construction of a minimal protein based self-replicating system 
[54]. 

Materials and Methods 

Media, chemicals, and reagents 

Unless specified, all chemicals were obtained from Sigma- 
Aldrich. PEG-6000 (molecular weight: 6 kDa) was purchased from 
Fisher (Pittsburg, PA). A oO'l-i, (w/v) stock solution of PEG-6000 
was prepared in nuclease-free water before being used in 
experiments. BSA was purchased from New England Biolabs. 
GroEL/ES were obtained from Takara; DnaJ was obtained from 
Accurate Chemical & Scientific Corporation, DnaK and GrpE 
were obtained from Novus Biologicals. Tryptone and yeast extract 
were obtained from BD Difco. For protein purification, liquid 
cultures of all strains were grown in SB media (24 g/L tryptone, 
12 g/L yeast extract, 5 g/L glucose, 2 g/L NaH2P04, 16.4 g/L 
K2HPO4-3H2O, 4 mL/L glycerol). Protein purifications were 
carried out with AKTAprime (GE Healthcare) equipped with 
5 mL HisTrap HP column (GE Healthcare). Purified protein 
concentrations were determined by standard Bradford assay (Bio- 
Rad). 



Molecular cloning 

The Flue gene was cloned into the Ncol and Xhol restriction 
sites of pIVEX-2.3d (5PRIME). MCherry and lacZ were cloned 
into the Ndel and Xhol restriction sites of pET-24b (Novagen). 
PIVEX 2.3d-Fluc, pET-24b mCherry and pET-24b lacZ were 
purified with phenol: chloroform: isoamyl alcohol extraction and 
ethanol precipitation before being used for in vitro transcription. 
EF-4 gene (lepA) sequence was amplified by PCR with primers 
5'GGAATTCCATATGAAGAATATACGTAACTTTTC- 
GAT3' and 5'CCGCTCGAGTTAGTGATGGTGATGGT- 
GATGTTTGTTGTCTTTGCCGACGTG3' from E. coli 
MG1655 genomic DNA (ATCC) and tiie PCR products were 
inserted into the Ndel and Xhol restrictions site of pET-24b. 
Plasmids encoding IFl, 2, 3, EF-Tu, Ts, G, RFl, 2, 3 and RRF 
were provided by T. Ueda. The vectors pLGl and pLG2 (with 
and without a stop codon, respectively) for expressing HaloTagged 
proteins were derived from pFN18K (Promega). Protein genes 
were inserted into a polylinker region between the sequences of an 
N-terminal HaloTag domain and a C-terminal 171-amino-acid 
alpha-helical spacer excised from E. coli TolA domain II. 
Specifically, the gene of mCherry was inserted by using by Ndel 
and Sad restriction sites and the genes of £. coli fusion proteins, 
EntE-mCherry and EntF-mCherry, were inserted by using Xhol 
and Sad restriction sites. Linear DNA templates encoding 
HaloTagged proteins were generated by PCR, purified and then 
used for PURE system reactions. PCR products started from T7 
promoter and ended right before or at stop codon of the 
HaloTagged proteins. 

Over-expression and purification of IFl, 2, 3, EF-Tu, Ts, G, 
RFl, 2, 3, RRF and EF4 

IFl, 2, 3, EF-Tu, Ts, G, RFl, 2, 3 and RRF were over- 
expressed and purified as described in [20] and [32]. PET-24b EF- 
4 were transformed to NEB BL21 (DE3) (New England Biolab) 
strain. Cells were grown in 300 mL SB with 50 |J,g/mL kanamycin 
at 37 °C, induced with 1 mM IPTG when ODgoo reached 0.5, 
and further incubated at 37 °C for 4 hours before harvest. Cell 
paste was lysed by 1 mL BugBuster 10 x Protein Extraction 
Reagent (EMD Chemical), with 100 |a,L Halt-protease inhibitor 
(Thermo-Fisher), 6 mM P-ME, and 20 mM imidazole-HOAc 
(pH 7.4). Cell debris was removed by centrifugation at 1 50,000 x g 
for 25 min twice. The supernatant was loaded to a Ni-NTA 
column and washed with wash buffer (20 mM Tris-HOAc, 
pH 7.6, 30 mM NH4CI, 150 mM KCl, and 150 mM NaCl) 
containing 20 mM Imiazole-HOAc (pH 7.4). The His-tagged EF- 
4 was eluted with a linear gradient from 20 mM to 400 mM 
Imidazole-HOAc in wash buffer, pooled, concentrated with 
Amicon-Ultra-4 concentrator with 3 K MWCO, and dialyzed 
against 1 L of stock bufier (20 mM Tris-HOAc, pH 7.6, 30 mM 
NH4CI, 150 mM KCl, 15 mM Mg(OAc)2, 6 mM PME, and 
10 |tM GDP) for 3 hours twice. Purified EF-4 was added to 20% 
glycerol and stored at — 80°C. 

PURE transcription and translation optimization 

PURE system kits were purchased from New England Biolabs. 
25 |J,L reactions were carried out following the PURE system 
manual with 10 |J.L solution A, 7.5 |iL solution B, 0.8 U/|J.L 
Murine Rnase Inhibitor (New England Biolabs), 10 |J,g/mL 
pFVEX 2.3d-Fluc, or 5 ng/mL pET-24b mCherry or 3 ^lg/mL 
pET-24b lacZ and proper amount of supplement factors as 
indicated in the text. Reactions were incubated at 37 °C for 
2 hours. Flue activity was measured by Promega Luciferase Assay 
System kit. P-gal activity was measured by Galacto-Light Plus P- 
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Galactosidase Reporter Gene Assay System (Life Technologies). 
Chemical luminescence in relative luminescent units (RLUs) and 
fluorescence in relative fluorescence units (RFU) were measured 
by a microplate reader (SpectraMax M5, Molecular Devices). 

mRNA quantification 

Afl;er the reaction, the DNA template was removed by digestion 
with 0.5 lol DNase I (>2,500 U/ml, Thermo scientific) at 37°C for 
15 min. Total mRNA was then quantified with the Quant-iT 
RiboGreen RNA Reagent and Kit (Invitrogen, CA) according to 
the manufacturer's instructions. 

Labelling of in vitro translated proteins with a fluorescent 
HaloTag ligand 

To further examine fuU-length and incomplete products of in 
vitro translation in PURE system, three N-terminal HaloTagged 
recombinant proteins of various sizes were in vitro translated and 
labelled with a fluorescent TMR reporter. To answer if the 
ribosome is recycled during the translation, two expression 
constructs with or without a stop codon were prc'parcd for each 
protein. Lin(-ar DNA templates were used for PURE system 
reactions. Reactions were incubated at 37°C for 2 hours, and 
translated proteins were incubated with 5 |a,M Halo-TMR 
(Promega) in a 1 x PBS buffer at room temperature for 30 min 
prior to SDS-PAGE analysis. The gel was scanned at 580 nm 
(excitation at 532 nm) by Typhoon Trio Imager (GE Healthcare) 
and a fluorescence gel image was analyzed with ImageQuant TL 
software. 

Supporting Information 

Figure SI Assessment of purified C-terminal His-tagged EF4. 

Bcn( liMark Protein Ladder (Life Technologies), E. coli cell lysate 
overexpressing C-terminal His-tagged EF4, flowthrough and 
eluted fractions of EF4 after Ni-NTA purifK;ation were analyzed 
on 4-12% Bis-Tris PAGE gel, stained by Coomassie-blue. EF4 
with a MW of 66.57 kD migrated as expected. 
(TIF) 

Figure S2 Optimization of PURE system as measured by active 
mCherry produced by supplementing different concentrations of 
EF-Tu, Ts, G; EF4; RFl, 2, 3 and RRF. (a). Active mCherry 
produced at different EF-Ts, Tu and G concentrations. The table 
below shows the actual concentration increase of EF-Ts, Tu and G 
in the PURE system, (b). Active mCherry produced at difierent 
EF4 concentrations, (c). Active mCherry produced at difierent 
RFl, 2, 3 and RRF concentrations. The table below shows the 
actual concentration increase of RFl, 2, 3 and RRF in the PURE 
system. MCherry activities were measured by relati\c fluorescence 
unit and PURE system reaction without supplement was set as 
control. Error bars are ± standard deviations, with n = 3. 
(TIF) 

Figure S3 Optimization of PURE system as measured by active 
P-gal produced by supplementing different concentrations of EF- 
Tu, Ts, G; EF4; RFl, 2, 3 and RRF. (a). Active P-gal produced at 
different EF-Ts, Tu and G concentrations. The table below shows 
the actual concentration increase of EF-Ts, Tu and G in the 
PURE system, (b). Active P-gal produced at different EF4 
concentrations, (c). Active P-gal produced at different RFl, 2, 3 
and RRF concentrations. The table below shows the actual 
concentration increase of RFl, 2, 3 and RRF in the PURE system. 
P-gal activities were measured in relative luminescence unit by 
Galacto-Light Plus P-Galactosidase Reporter Gene Assay System 
(Life Technologies) and PURE system reaction without supple- 



ment was set as control. Error bars are ± standard deviations, with 

n = 3. 

(TIF) 

Figure S4 MCherry and P-gal synthesis in the PURE system 
with BSA as a macromolecular crowding agent, (a) Active 
mCherry produced at different concentrations of BSA. MCherry 

activities were measured in relati\'(^ fluorescence unit, (b) Active P- 
gal produced at different concentrations of BSA. P-gal activities 
were measured in relative luminescence unit by Galacto-Light Plus 
P-Galactosidase Reporter Gene Assay System (Life Technologies). 
PURE system reaction without supplement was set as control. 
Error bars are ± standard deviations, with n = 3. 
(TIF) 

Figure S5 Assessment of Flue yield at different concentrations of 
GroEL/ES in PURE system. PIVEX 2.3d-Fluc was added to 
PURE system reaction mixture with different concentrations (from 
control 0 \iM to 6 \lM) of GroEL/ES. After 2 hours incubation at 
37°C, each reaction was analyzed directly on 4—12% Bis-Tris 
PAGE gel, stained by Coomassie-blue. The expected migration 
bands of GroEL/ES and firefly luciferase are marked on the gel. 
(TIF) 

Figure S6 MCherry synthesis in the PURE system with 
chaperone systems GroEL/ES and DnaK/DnaJ/GrpE. (a). 
Active mCherry produced at different GroEL/GroES concentra- 
tions, (b). Active mCherry produced at different DnaK/DnaJ/ 
GrpE concentrations. MCherry activities were measured in 
relative fluorescence unit and PURE system reaction without 
supplement was set as control. Error bars are ± standard 
deviations, with n = 3. 
(TIF) 

Figure S7 P-gal synthesis in the PURE system with chaperone 
systems GroEL/ES and DnaK/DnaJ/GrpE. (a). Active P-gal 
producxxl at different GroEL/GroES concentrations, (b). Active P- 
gal produced at different DnaK/DnaJ/GrpE concentrations. P- 
gal activities were measured in relative luminescence unit by 
Galacto-Light Plus P-Galactosidase Reporter Gene Assay System 
(Life Technologies) and PURE system reaction without supple- 
ment was set as control. Error bars are ± standard deviations, with 
n = 3. 
(TIF) 

Figure S8 MCherry and P-gal synthesis in the PURE system 
with their best combination of enzyme factor concentrations and 
two difierent tRNA concentrations, (a) Active mCherry produced 
with the best combination of EF-Tu, Ts, G; EF4; RRF, RFl, RF2, 
RF3 and BSA concentrations and two different tRNA concentra- 
tions. MCherry activities were measured in relative fluorescence 
units, (b) Active P-gal produced with the best combination of EF- 
Tu, Ts, G; EF4; RRF, RFl, RF2, RF3; GroEL/ES and BSA 
concentrations and two difierent tRNA concentrations. P-gal 
activities were measured in relative luminescence unit by Galacto- 
Light Plus P-Galactosidase Reporter Gene Assay System (Life 
Technologies). Error bars are ± standard deviations, with n = 3. 
(TIF) 

Figure S9 Optimization of PURE system as measured by 
functional Flue produced by addingdifferent concentrations of 
RRF, EF-G, IFl, 2 and 3. (a). Active Flue produced at different 
RRF, EF-G and IF3 concentrations. The table below shows the 
actual concentration increase of RRF, EF-G and IF3 in PURE 
system, (b). Active Flue produced at different IFl, 2 and 3 
concentrations. The table below shows the actual concentration 
increase of IFl, 2 and 3 in PURE system. Flue activities were 
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measured in relative luminescence unit by luciferase assay and 
PURE system reaction without supplement was set as control. 
Error bars are ± standard deviations, with n = 3. 
(TIF) 

Figure SIO Assessment of PURE system translation by produc- 
tion of HaloTag fusion proteins using linear DNA templates. Left 
panel shows the three HaloTag fusion proteins with and without 
stop codon (90 kD, 160 kD, 220 kD) expressed in PURE system 
on SDS-PAGE gel, stained by Coomassie-blue. The right panel 
shows the same gel but with samples incubated with HsdoTag 
TMR ligand. The gel was scanned by a typhoon scanner with 
filter set (555 nmEx/580 nmEm). Therefore half-translated prod- 
ucts can be shown via scanning on the gel. Lane 1 is protein 
marker. Lane 2, 4, 6 are HaloTag fusion proteins without stop 
codon. Lane 3, 5, 7 are HaloTag fusion proteins with stop codon. 
Lane 8 is a negative control with no DNA template. (HT: 
HaloTag; Ch: mCherry. TolA is a C-terminal 171-amino-acid 
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